Regularized Mapping to Latent Structures and Its Application to Web Search

نویسندگان

  • Wei Wu
  • Zhengdong Lu
  • Hang Li
چکیده

Projection to Latent Structures (PLS), also known as Partial Least Squares, is a method for matching objects from two heterogeneous domains. Although PLS is empirically verified effective for matching queries and documents, its scalability becomes a major hurdle for its application in real-world web search. In this paper, we study a general framework for matching heterogeneous objects, which renders a rich family of matching models when different regularization are enforced, with PLS as a special case. Particularly, with l1 and l2 type of regularization on the mapping functions, we obtain the model called Regularized Mapping to Latent Structures (RMLS). RMLS enjoys many advantages over PLS, including lower time complexity and easy parallelization. As another contribution, we give a generalization analysis of this matching framework, and apply it to both PLS and RMLS. In experiments, we compare the effectiveness and efficiency of RMLS and PLS on large scale web search problems. The results show that RMLS can achieve equally good performance as PLS for relevance ranking, while significantly speeding up the learning process.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics

This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...

متن کامل

Query expansion based on relevance feedback and latent semantic analysis

Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...

متن کامل

PARTICLE SWARM-GROUP SEARCH ALGORITHM AND ITS APPLICATION TO SPATIAL STRUCTURAL DESIGN WITH DISCRETE VARIABLES

Based on introducing two optimization algorithms, group search optimization (GSO) algorithm and particle swarm optimization (PSO) algorithm, a new hybrid optimization algorithm which named particle swarm-group search optimization (PS-GSO) algorithm is presented and its application to optimal structural design is analyzed. The PS-GSO is used to investigate the spatial truss structures with discr...

متن کامل

Learning bilinear model for matching queries and documents

The task of matching data from two heterogeneous domains naturally arises in various areas such as web search, collaborative filtering, and drug design. In web search, existing work has designed relevance models to match queries and documents by exploiting either user clicks or content of queries and documents. To the best of our knowledge, however, there has been little work on principled appr...

متن کامل

Mappings to Realcompactifications

In this paper, we introduce and study  a mapping from the collection of all  intermediate rings of $C(X)$ to the collection of all  realcompactifications of $X$ contained in $beta X$. By establishing the relations between this mapping and its converse,  we give a different approach to the main statements of De et. al. Using these, we provide different answers to the   four basic questions...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012